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ABSTRACT Galactose is important for the survival and virulence of bacteria. In Escherichia coli, galactose is utilized by the Leloir 
pathway, which is controlled by a complex network. To shed light on the potential functions the galactose network could per- 
form, we performed bioinformatical analysis of reference genome sequences belonging to the Enterobacteriaceae family. We 
found that several genomes have reduced numbers of components compared to the E. coli galactose system, suggesting that the 
network can be optimized for different environments. Typically, genes are removed by deletions; however, in Yersinia pestis, the 
galactose mutarotase (galM) gene is inactivated by a single-base-pair deletion. Lack of GalM activity indicates that the two ano- 
mers of D-galactose are used for different purposes, a-D-galactose as a carbon source and /3-D-galactose for induction of UDP- 
galactose synthesis for biosynthetic glycosylation. We demonstrate that activity of the galM gene can be restored by different 
single-base-pair insertions. During the evolution of Y. pestis to become a vector-transmitted systemic pathogen, many genes 
were converted to pseudogenes. It is not clear whether pseudogenes are present to maintain meiotrophism or are in the process 
of elimination. Our results suggest that the galM pseudogene has not been deleted because its reactivation may be beneficial in 
certain environments. 

IMPORTANCE Evolution of bacteria to populate a new environment necessarily involves reengineering of their molecular net- 
work. Members of the Enterobacteriaceae family of bacteria have diverse lifestyles and can function in a wide range of environ- 
ments. In this study we performed bioinformatical analysis of 34 reference genome sequences belonging to the Enterobacteri- 
aceae family to gain insight into the natural diversity of the D-galactose utilization network. Our bioinformatical analysis shows 
that in several species, some genes of the network are completely missing or are inactivated by large deletions. The only excep- 
tion is the galactose mutarotase (galM) gene of Yersinia pestis, which is converted to a pseudogene by a single-base-pair deletion. 
In this paper, we discuss the possible consequences of galM inactivation on network function. We suggest that galM was con- 
verted to a pseudogene rather than being deleted in evolution because its reactivation can be beneficial in certain environments. 
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Evolution of bacteria to populate a new environment neces- 
sarily involves reengineering of their molecular network. 
Changes can affect the elements (e.g., genes) of the network as 
well as the interactions in the network. Redundant genes or those 
that antagonize successful colonization in the new environment 
can be inactivated by point mutations or removed by deletions 
(1). Also, genes required for adaptation to a new niche can be 
acquired by horizontal transfer (e.g., pathogenicity islands) (2). A 
major determinant of network function is network logic, which 
depends on the interactions between network elements (3). Re- 
cent studies demonstrated that network logic can be easily engi- 
neered by mutations in regulatory sequences (4, 5). Rearrange- 
ments in metabolic pathways were analyzed in Yersinia pestis, 
which diverged recently from Yersinia pseudotuberculosis, a gas- 
trointestinal pathogen (6). Analysis of the Yersinia pestis metabolic 
network suggested excellent agreement between the possible met- 
abolic reaction pathways and the known nutritional needs of 
Y. pestis cells (1). It is generally assumed that cellular responses to 
the natural levels of perturbations in the concentration of critical 



chemical compounds are optimized. Along this logic, capabilities 
of regulatory networks would reflect the potential nature of envi- 
ronments and the environmental changes cells may face (7). 

Members of the Enterobacteriaceae family of bacteria have di- 
verse lifestyles and can function in a wide range of environments. 
In this study we performed bioinformatical analysis of 34 refer- 
ence genome sequences belonging to the Enterobacteriaceae family 
to gain insight into the natural diversity of the D-galactose utiliza- 
tion network. The rationale of using the D-galactose network is 
that D-galactose metabolism can be an important factor in viru- 
lence in different bacteria (8-10). For example, in Erwinia amylo- 
vora, the causal agent of fire blight of rosaceous plants, galactose 
metabolism affects capsule synthesis and virulence (9). In Y. pestis, 
both the D-galactose transport and metabolism operons are 
induced when the temperature is shifted from 26°C to 37°C, 
which corresponds to the temperature change in the transition 
from the flea to the mammalian host (11). Also, incorporation 
of j3-D-galactose into the lipopolysaccharide (LPS) of Y. pestis 
inside the transmitting vector (flea) may be important (12). 
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The D-galactose network of Escherichia coli is well characterized In this paper, we focus primarily on the role of galactose mutaro- 
(13) and suitable to be used as a reference for the comparative tasein the D-galactose utilization network. We show that the function 
analysis . of the galM pseudogene of Y. pestis can be restored by single-base-pair 

Escherichia coli utilizes D-galactose by the amphibolic Leloir insertions in different ways. Our analysis predicts that the two 
pathway (Fig. 1). The metabolic steps of D-galactose utilization D-galactose anomers, a- and j3-D-galactose, play different roles in 
are catalyzed by the GalK (galactokinase), GalT (galactose Y. pestis. Only a-D-galactose can be used as a carbon source, while 
1-phosphate uridylyltransferase), and GalE (UDP-galactose-4- j8-D-galactose can induce the production of UDP-galactose, a com- 
epimerase) proteins. In E. coli, the genes encoding these proteins pound used in biosynthetic glycosylation reactions, 
belong to the same operon, together with the galM gene, which 
encodes a mutarotase that allows the intracellular interconversion RESULTS 

of the two anomers of D-galactose (14), a-D-galactose and j3-D- Identification of elements of the D-galactose network in entero- 

galactose. Only a-D-galactose serves as a substrate of galactoki- bacteria. Because metabolic enzyme sequences are highly con- 
nase, the first enzyme of the Leloir pathway. D-Galactose can be served across species, metabolic networks can be successfully re- 
transported by two D-galactose-induced transport systems, the constructed primarily based on genome sequence data (24). 
low-affinity GalP and the high-affinity Mgl systems (15). Trans- Therefore, we used the BLAST program (25) to identify DNA and 
port and utilization of D-galactose is regulated by the Gal repressor protein sequences in reference genome sequences of strains be- 
(GalR) and the Gal isorepressor (GalS). These repressors bind the longing to the Enterobacteriaceae family, which are similar to corn- 
same set ofoperators in the regulatory regions of the genes belong- ponents of the D-galactose network of E. coli K-12 MG1655 
ing to the gal regulon in the absence of D-galactose. D-Galactose (Fig. 2). We found only four genes, galR (encoding the galactose 
binding to GalR (similar to that to GalS) allosterically inhibits its repressor), galE (encoding the UDP-galactose-4-epimerase), crp 
operator binding (13, 16, 17). A recent study demonstrated that (encoding the cyclic AMP [cAMP] receptor protein [CRP]), and 
both of these anomers of D-galactose are effective in the binding spf (encoding the Spot42 sRNA), which are present in all the ge- 
and inactivation of GalR (18). nomes studied. All 12 genes studied (galETKM, mglBAC, galR, 

In the Leloir pathway, only the galE gene product is involved in galS, galP, crp, and spf) are present in 19 of the 34 genomes ana- 
making substrates for biosynthetic glycosylation reactions, while lyzed. However, in six of these strains, the galE gene is not part of 
all of the Gal enzymes are needed for catabolism of D-galactose. the gal operon. Six strains have the high-affinity Mgl transport 
Therefore, when D-galactose is not available or not preferred as a system but not the low-affinity GalP symporter, suggesting that 
carbon source, expression of the gal operon genes is discoordi- the galactose utilization networks of these strains are optimized 
nated, resulting mostly in GalE synthesis ( 19-21). Enterobacteria for low-galactose environments (Dickeya dadantii, Dickeya zeae, 
evolved different mechanisms to allow such discoordination. Sodalis glossinidius, Yersinia enterocolitica, Y. pestis, and Y.pseudo- 
These include using small RNA (sRNA)-mediated translational tuberculosis strains). As opposed to those strains, Edwardsiella ic- 
regulation (22), differentially regulated promoters and natural taluri, Edwardsiella tarda, Erwinia pyrifoliae, Shigella boydii, and 
polarity (19-2 1,23), and physical separation of the galE gene from Shigella dysenteriae have only the low-affinity transporter, which 
the rest of the gal operon genes (23). requires high levels of extracellular D-galactose for proper func- 

tion. The three Pectobacterium strains and 
Proteus mirabilis lack both transport sys- 
tems. Genes responsible for galactose me- 
tabolism (galE, galK, galT, galM) can be 
found in different arrangements in the 
strains analyzed; however, there are two 
strains in which we found inactive com- 
ponents. In Sodalis glossinidius, only the 
galE gene is intact, suggesting that in these 
cells, UDP-galactose required for biosyn- 
thetic purposes is produced from 
UDP-glucose. In Y. pestis, the galM gene 
contains a single-nucleotide deletion, re- 
sulting in a frameshift. 

Our bioinformatical analysis shows 
that in most of the cases, genes are 
completely missing or are inactivated 
by large deletions. The only exception is 
the galM gene of Y. pestis, which is 
converted to a pseudogene. The same 
pseudogene was found in all the 
Y. pestis genome sequences (e.g., 
GenBank accession no. NC_010159.1, 
NC_009381.1, NC_008149.1, NC_008150.1, 
NC_005810.1, NC_003143.1, and NC_ 
004088.1). 




FIG 1 Regulation of proteins involved in D-galactose transport and utilization. Vast grey arrows 
indicate the major metabolic flows. This figure is based on the E. coli D-galactose network (13) but for 
simplicity shows only the gene products (proteins and sRNA) of the 12 genes involved in the bioinfor- 
matical analysis. Products of the four genes which are present in all 34 genomes studied are typed in 
black, and the others which are missing from one or more genomes are shown in grey. Red lines indicate 
inhibitions, while green lines indicate enhancements. Proteins which often belong to the same operon 
are shown in black boxes. 
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FIG 2 Presence of genes responsible for D-galactose transport {galP, mglBAC), utilization (galETKM), and regulation of these two processes (gaIR, galS, crp, spf) 
in genomes of strains belonging to the Enterobacteriaceae family (GenBank accession numbers are shown on the left, and more details are provided in Materials 
and Methods). Filled boxes indicate the presence of intact genes, based on bioinformatical analysis. Empty boxes indicate genes disrupted by extensive deletions 
or insertions. The galM pseudogene of Y. pestis, which is inactivated by a single-base-pair deletion, is marked by an asterisk. Arrowheads indicate the direction 
of transcription for putative operons and ORFs separated by less than 1 kbp. 



Functional restoration of the galM pseudogene of Y. pestis. 

Comparison of the Y. pestis and Y. pseudotuberculosis genomes 
indicated that the transition from the enteropathogen form to the 
vector-transmitted systemic pathogen form resulted in more than 



300 pseudogenes (26, 27). However, it remains to be answered 
whether pseudogenes are present to maintain meiotrophism or 
are in the process of elimination. The Y. pestis galM pseudogene 
differs from the Y. pseudotuberculosis YPIII galM gene only by a 
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single -base-pair deletion (at nucleotide position 117 of the open 
reading frame [ORF] ) and a same-sense mutation (at nucleotide 
position 1 80) . Therefore, this pseudogene can likely be reactivated 
by insertion of a single base pair. The GalM protein that would be 
encoded by the "repaired" Y. pestis galM gene is identical to the 
Y. pseudotuberculosis YPIII GalM protein (GenBank accession no. 
YP_00 172 1673.1). Interestingly, the Yersinia pseudotuberculosis 
IP 32953 galM gene encodes a different amino acid at the position 
of the required insertion (D instead of N), suggesting that the 
Y. pestis pseudogene can be restored in multiple ways. To test this 
hypothesis we synthesized the Y. pestis galM pseudogene and its 
different "restored" versions and inserted them into a low-copy- 
number plasmid. The plasmid-borne genes were tested whether 
they can complement a chromosomal galM deletion in E. coli. 
GalM function was tested by studying growth on intracellularly 
produced j3-D-galactose. Cells were grown in the presence of 
phenyl-jS-D-galactopyranoside as the sole carbon source. Hydro- 
lysis of phenyl-/3-D-galactopyranoside by /3-galactosidase gener- 
ates phenol and j3-D-galactose. Phenol is excreted into the culture 
medium and does not accumulate in the cell. Cells were grown 
overnight in LB medium and then washed in M63 minimal me- 
dium. Washed cells were plated on M63 minimal medium supple- 
mented with 2 mM phenyl- j3-D-galactopyranoside, 0.0004% vita- 
min Bj, 0.5 mM IPTG (isopropyl-thiogalactoside; to induce 
j8-galactosidase production), and 15 /xg/ml tetracycline. We 
found that growth strongly depends on the presence of functional 
galM. Cells containing the Y. pestis galM pseudogene showed very 
slow growth, similar to that of the E. coli AgalM cells harboring the 
empty plasmid. This slow growth reflects the low level of nonen- 
zymatic mutarotation of the D-galactose anomers. However, two 
"restored" versions of GalM (N39, D39) conferred similar growth 
to wild- type E. coli MG1655 cells (Fig. 3). The difference in the size 
of colonies reflects a minimum doubling time of 135 to 150 min 
for the galM + and about 1,000 min for the AgalM strains, as mea- 
sured in liquid cultures. GalM version Q39 grew similarly to 
galM + , 139 and T39 showed intermediate growth, while K39, R39, 
and E39 were not growing substantially faster than AgalM. 

Effect of galM and galP deletion on utilization of extracellu- 
lar D-galactose in vivo. Y. pestis strains have only one of the 
D-galactose-induced transport systems. The three genes of the 
high-affinity Mgl ( /3-methylgalactoside) transport system are part 
of a single operon, similar to E. coli (mglBAC). However, Y. pestis 
strains lack the low-affinity GalP transporter. To simulate the 
function of the reduced network lacking both the low-affinity 
transporter and the galactose mutarotase, we created a AgalM 
AgalP double mutant of E. coli MG1655. We compared the growth 
of this double mutant with wild-type and single mutant strains 
(Fig. 4) in minimal D-galactose medium. We found that deletion 
of galM increases the time needed to reach stationary phase by 
about 100 minutes, while deletion of galP increases it by about 
200 minutes, compared to the time needed by the wild-type strain. 
As expected, in the logarithmic phase, the AgalM mutant shows a 
growth rate similar to that of the wild-type strain (14). However, 
the double mutant shows growth similar to that of the AgalP 
strain, suggesting that growth is limited by the lack of the low- 
affinity transporter and not by the low rate of spontaneous 
D-galactose mutarotation. Because intracellular mutarotation of 
D-galactose is inefficient in the absence of GalM (14), our results 
suggest that besides /3-D-galactose, the Mgl transport system can 
efficiently transport a-D-galactose as well; therefore, the presence 




r 



FIG 3 Complementation of a chromosomal galM mutation (AgalM) in£. coli 
by two restored versions of the Y. pestis galM pseudogene carried on a low- 
copy- number plasmid. Cells containing a functional copy of galM grow faster 
on intracellularly produced /3-D-galactose as the sole carbon source. Hydroly- 
sis of phenyl-)3-D-galactopyranoside by /3-galactosidase generates phenol and 
/3-D-galactose. Phenol is excreted into the culture medium and does not accu- 
mulate in the cell. MG1655Aga/M cells containing plasmid pLG338E carrying 
the Y. pestis galM pseudogene (C) and its two restored versions, N39 (D) and 
D39 (E), were plated on M63 minimal medium containing 2 mM phenyl-/3- 
D-galactopyranoside and also 0.5 mM IPTG to induce j3-galactosidase produc- 
tion. E. coli MG1655/pLG338E (A) and E. coli MG1655Agn/M/pLG338E (B) 
were plated as controls for comparison. 

of mutarotase does not substantially increase the rate of 
D-galactose utilization. Also, because j3-D-galactose inactivates 
GalR similarly to a-D-galactose (18, 28), the unused /3-D-galactose 
pool in the cell may result in higher derepression of the gal regulon 
genes; therefore, it allows higher rates of transport and utilization, 
which can compensate for the lack of /3-D-galactose utilization in 
the double mutant. 
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FIG 4 Effect of galM and galP deletion on growth on D-galactose as a carbon 
source. We compared the growth of the \galM AgalP double mutant ( + ) with 
those of the wild-type (■), AgalM (O), and AgalP (*) strains in minimal 
D-galactose medium. Cells were grown overnight in LB medium and diluted in 
M63 minimal medium supplemented with 0.0004% vitamin Bj and 0.3% 
D-galactose. OD 600 , optical density at 600 nm. 



Prediction of regulatory links in the Y. pestis galactose net- 
work by sequence analysis. By searching for genes similar to the 
regulator genes of the E. coli D-galactose system, we could confirm 
the presence of galR, crp (encoding the cAMP receptor protein), 
and spf (encoding the Spot42 small RNA). The amino acid se- 
quences of the GalR proteins in Y. pestis strains are about 80% 
identical to those of the E. coli GalR protein. The DNA recognition 
helices are identical, suggesting that the same sequences are rec- 
ognized by the Y. pestis and E. coli GalR proteins. However, the 
galS repressor gene was not found in Y. pestis strains. In order to 
predict regulatory links in the Y. pestis D-galactose network, we 
compared the regulatory regions of the Y. pestis galETK, mglBAC, 
and galR genes with the corresponding regions in E. coli. We found 
that the binding sites of the regulatory proteins in the control 
region of the galETK operon are arranged in a way similar to that 
found in E. coli. There are two GalR binding sites (operators); 
however, the spacing between the two operators is 1 bp shorter 
than that in E. coli. Comparison of the promoter elements sug- 
gested that the Pl ga i B promoter is functional but that the P2 gam 
promoter is significantly weaker than that in E. coli. In Y. pestis 
strains, the sequence found at the position of the extended —10 
element of the E. coli P2 galE promoter (TTTGTTATGCT) is AAT 
GGCGTGCT, which is not similar to the consensus — 10 element. 

Based on the sequence similarities, the galR gene is assumed to 
be autoregulated in the same way as that in E. coli (13). The regu- 
latory region and promoter of the mglBAC operon in Y. pestis 
strains are also highly similar to those of E. coli; however, the 5' 
untranslated region of the mglBAC mRNA (about 250 bp in both 
species) shows no similarity. The sequence found in Y. pestis is 
conserved in Y. pestis and Y. pseudotuberculosis strains. A similar 5' 
untranslated region of the mglBAC mRNA is found in Yersinia 
enterocolitica, but we could not find any other similar sequences in 
the database. 
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FIG 5 Transcription regulation of the Y. pestis galETK operon. In vitro tran- 
scription assays were performed on plasmid templates containing the regula- 
tory region of the gal operon from E. coli (A) and from Y. pestis (B). Lane 2 
shows the effect of GalR™, while lane 3 shows the combined effect of GalR YP 
and HU. Protein concentrations are indicated below the lanes. The RNA1 
transcript, which does not vary with the GalR concentration, was used as an 
internal control between lanes. 



Transcription of the Y. pestis galETK operon is initiated 
from a single promoter. In order to test the predictions of se- 
quence analysis, we studied the regulation of the Y. pestis galETK 
operon in a purified system. We performed in vitro transcription 
assays using E. coli RNA polymerase (a 70 ), purified Y. pestis GalR 
(GalR YP ), and plasmid templates containing the regulatory region 
of the Y. pestis galETK operon. We used a similar plasmid con- 
struct containing the corresponding regulatory region from E. coli 
(pSA850) (29) for comparison (Fig. 5). We found that the GalR YP 
protein can regulate the E. coli galETKM operon the same way as 
E. coli GalR. In the presence of GalR YP , the level of P2 galE transcrip- 
tion is increased, while Pl ga i E transcription is repressed. When 
both GalR YP and HU (from E. coli) are present, both promoters 
are repressed by DNA looping (30) (Fig. 5). However, unlike in 
E. coli, transcription of the Y. pestis galETK operon is initiated only 
from a single site, corresponding to the start site of the Pl ga m 
promoter in E. coli. GalR YP repressed this promoter equally in 
both the presence and absence of HU. Similar to the Pl gam pro- 
moter of E. coli, cAMP-CRP strongly activated the Y. pestis gal 
promoter, and GalR reduced the effect of cAMP-CRP (Fig. 6). 
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FIG 6 Combined effect of GalR and CRP on transcription of the Y. pestis gal 
operon promoter (lanes 5 to 8). Regulation of the£. coli gal operon promoters 
is shown for comparison (lanes 1 to 4). Protein concentrations are indicated on 
top. The RNA1 transcript, which does not vary with the GalR, HU, or CRP 
concentrations, was used as an internal control between lanes. 
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FIG 7 Predicted D-galactose network in Y. pestis. Vast grey arrows show major metabolic flows (Leloir 
pathway). Grey ellipses designate intracellular D-galactose and galactoside pools containing a- and 
j3-D-galactose anomers. Red lines indicate transcriptional regulation. The blue line indicates Spot42 
RNA-mediated translational control. Other enhancements and inhibitions are indicated by green lines. 



DISCUSSION 

Functional diversity of the D-galactose network in enterobacte- 
ria. The functional role of different genes in the D-galactose utili- 
zation network of E. coli has been studied extensively. Therefore, 
based on the set of genes present in a certain reduced network, it is 
possible to formulate predictions about the functional conse- 
quences of reduction. Functional diversity of the reduced net- 
works affects both transport and metabolism. Some networks 
contain only the low-affinity transporter (5), while others have 
only the high-affinity transport system (6), suggesting optimal 
performance in high-galactose and low-galactose environments, 
respectively. In four cases, we found that both of these transport- 
ers are missing; therefore, the function of the galactose system is 
limited to endogenous inducer synthesis (31) and metabolism of 
intracellular D-galactose obtained from galactose-containing 
compounds. Galactose metabolism is affected in two cases, as fol- 
lows. In the S. glossinidius network, only the galE gene of the gal 
operon is functional, indicating that this network is incapable of 
amphibolic utilization of D-galactose. In Y. pestis, the last gene of 
the gal operon, galM, is inactivated by a single-base-pair deletion. 

D-Galactose utilization in Y. pestis. To understand the conse- 
quences of galM inactivation, we identified elements and interac- 
tions in the D-galactose network of Y. pestis by combining exper- 
imental results with the results of bioinformatical analyses (Fig. 7). 
The Y. pestis network is less complex than the E. coli D-galactose 
network. Both the number of elements and the number of inter- 
action links are reduced. The Y. pestis D-galactose system is regu- 
lated by a single D-galactose-responsive regulator, which is inter- 
changeable with E. coli GalR in in vitro experiments. D-Galactose 
transport of Y. pestis strains is also simplified, having only the 
high-affinity Mgl transport system, which can transport 
D-glucose, D-galactose, and j3-methylgalactosides in E. coli. 

Our results suggest that the Y. pestis D-galactose network, un- 



like that in E. coli, can utilize a-D- 
galactose but not j3-D-galactose. Based on 
the comparative network analysis pre- 
sented in this work, we can formulate pre- 
dictions about the potential D-galactose- 
related environments and the function of 
the D-galactose network in such environ- 
ments. Our results show that the simpli- 
fied network found in Y. pestis is less com- 
petitive in utilizing large extracellular 
D-galactose pools; however, it can use 
D-galactose when present at a constant 
low level due to the presence of the high- 
affinity Mgl transport system. This sug- 
gests that niches occupied by Y. pestis 
(e.g., arthropod vector, macrophages, 
and human blood) (32) are generally 
poor in D-galactose. Also, from the ab- 
sence of galS we can conclude that extra- 
cellular D-galactose levels are less dynamic 
in these niches than in the niches occu- 
pied by E. coli (33). 

The D-galactose system is also involved 
in the utilization of intracellular 
D-galactose obtained from the degrada- 
tion of oligosaccharides (e.g., lactose and 
melibiose) (14). Because galM is inacti- 
vated in the Y. pestis network, intracellular degradation of a-D- 
galactose- and /3-D-galactose-containing compounds could have 
different effects. The system can utilize intracellular a-D-galactose 
as a carbon source and also to provide building blocks for biosyn- 
thetic glycosylation. For example, unlike E. coli, Y. pestis has a 
galactan transport and utilization system which can degrade ex- 
tracellular galactan into smaller oligomers that are transported 
and processed to a-D-galactose oligomers inside the cell (34). The 
Mgl transport system can transport j3-D-galactosides (e.g., 
methyl-j3-D-galactoside and D-glycerol-/3-D-galactoside) (35). 
However, /3-D-galactose obtained from intracellular degradation 
of such compounds could induce the gal regulon genes but would 
not be utilized by the system. Induction of the galETK operon by a 
nonmetabolized inducer in the absence of a-D-galactose can serve 
biosynthetic purposes because GalE can catalyze the production 
of UDP-galactose (from UDP-glucose), which is required for bio- 
synthetic glycosylation (Fig. 7). Such metabolic flow control in the 
amphibolic D-galactose pathway is common in enterobacteria. 
Synthesis of D-galactose-containing polysaccharides (e.g., in LPS) 
is often required for pathogenesis (8-10), and the need for UDP- 
galactose is independent of D-galactose availability. Different 
mechanisms have been reported so far, which take advantage of 
the fact that only the galE gene product is involved in making 
substrates for biosynthetic glycosylation reactions, and all of the 
gal enzymes are needed for catabolism of the sugar D-galactose. A 
common mechanism in enterobacteria is discoordinated expres- 
sion of the gal operon genes. This can be achieved by using a small 
regulatory RNA (Spot42) which does not affect translation of GalE 
and GalT but blocks GalK production (22) or by transcribing the 
gal operon from two different promoters, one of which transcribes 
mostly the first gene of the operon, galE, because of natural polar- 
ity (19-21, 23). In certain pathogenic enterobacteria, the galE gene 
is physically separated from the other gal operon genes, and its 
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expression is independent of intracellular D-galactose levels (9). 
Intracellular synthesis of a nonmetabolized inducer constitutes a 
novel mechanism for regulation of the amphibolic D-galactose 
pathway. The observation that utilization of intracellular j3-D- 
galactose by the Y. pestis network can be turned on by single-base- 
pair insertions in galM suggests that the galM pseudogene has not 
been deleted in evolution because its reactivation is beneficial in 
certain environments. 

MATERIALS AND METHODS 

Bacterial strains. Genome sequences of 34 strains were involved in the 
bioinformatical analysis. The strains used are the following: Citrobacter 
koseri ATCC BAA-895, Citrobacter rodentium ICC168, Cronobacter saka- 
zakii ATCC BAA-894, Cronobacter turicensis z3032, Dickeya dadantii 
3937, Dickeya zeae Ech 1591, Edwardsiella ictaluri 93-146, Edwardsielia 
tarda EIB202, Enterobacter cloacae subsp. cloacae ATCC 1 3047, Enterobac- 
ter sp. strain 638, Erwinia amylovora CFBP1430, Erwinia billingiae Eb661, 
Erwinia pyrifoliae Epl/96, Erwinia tasmaniensis Etl/99, Escherichia coli 
MG1655, Escherichia jergusonii ATCC 35469, Klebsiella pneumoniae 342, 
Klebsiella variicola At-22, Pantoea ananatis LMG 20103, Pantoea vagans 
C9-1, Pectobacterium atrosepticum SCRI1043, Pectobacterium carotovo- 
rum PCI, Pectobacterium wasabiae WPP163, Proteus mirabilis HI4320, 
Salmonella enterica serovar Typhimurium strain LT2, Serratia pro- 
teamaculans 568, Shigella boydii CDC3083-94, Shigella dysenteriae Sdl97, 
Shigella flexneri 5 strain 8401, Shigella sonnei Ss046, Sodalis glossinidius 
strain "morsitans," Yersinia enterocolitica subsp. enterocolitica 8081, Yer- 
sinia pestis KIM10, and Yersinia pseudotuberculosis IP32953. 

E. coli MG1655Aga/M87::Kan r was described by Bouffard et al. (14). 
MG1655Agfl/P::cm r and MG1655Aga/M::Kan r Aga/P::cm r are derivatives 
of E. coli MG1655 and MG1655Aga/M87::Kan r , respectively. The chlor- 
amphenicol resistance cassette was inserted into the galP gene by recom- 
bineering (36). The cat gene from plasmid pRFB122 (37) was amplified 
using the primers GalPupcat (5'-CTCACCTATCTAATTCACAATAAAA 
AATAACCATATTGGAGGGCATCAAAATGAGACGTTGATCGGCAC 
GTAAGA-3') and GalPdowncat (5' -CCTCCGCGATGGGAGGAAGCT 
TGGGGAGATTAATCGTGAGCGCCTATTTCTTACGCCCCGCCCTG 
CCACTCATCGCAG-3'). Recombineering was performed according to 
the protocol described by Datsenko and Wanner (38). 

DNA manipulation methods. Bacterial growth conditions and plas- 
mid manipulations followed the protocols described by Sambrook and 
Russell (39). Transformations were performed with chemically compe- 
tent XLl-Blue cells (Stratagene). Restriction endonucleases and DNA oli- 
gonucleotide primers were purchased from Invitrogen, PCR (GeneAmp 
XL) and sequencing (ABI Prism) kits from Applied Biosystems, and DNA 
purification kits from Qiagen. DNA sequencing reactions were analyzed 
in a PerkinElmer/Applied Biosystems (model 373 A) automated se- 
quencer. 

Plasmid construction. Plasmid pSA850YP was made by inserting 
the Y. pestis galETK regulatory region between the EcoRI and PstI sites of 
plasmid pSA850 (29). Because of safety considerations, the DNA frag- 
ment corresponding to the chromosomal region 3350221 to 3350532 of 
Y. pestis KIM (GenBank accession no. NCJ)04088.1) was amplified 
from Yersinia pseudotuberculosis ATCC 23207, which contains an identi- 
cal sequence. The primers used for amplification were YPP1 (5'- 
AAAAGAATTCGCGCACCACAAACAGGACATTCC) and YPP2 (5'- 
AAAACTGCAGCAATTGCACACAGGTATGGCTACC). The sequence 
of the Y. pestis galETK regulatory region in plasmid pSA850YP was 
verified. 

The plasmid pSEM1026YP for the expression of Y. pestis GalR was 
created by amplifying the galR gene from Yersinia pseudotuberculosis 
ATCC 23207 using primers YPGNCO ( 5 ' - AAAACCATGGCCACTA 
TAAAGGATGTTGCCAAGCT) and YPGCTBI ( 5 ' - AAAAGGATCCAT 
CAGTGTCATCCCGTAGGCTTGGC) and inserting it between the Ncol 
and BamHI sites of plasmid pSEM1026 (40). The cloned Y. pseudotuber- 



culosis galR gene is 99% identical to the sequence of Y. pestis galR, while the 
amino acid sequences of the encoded proteins are identical. 

The galM pseudogene of Y. pestis (ATCC BAA- 1504) was PCR ampli- 
fied from a genomic DNA preparation (purchased from ATCC) using the 
primers YP_Mup_XhoI (5' -TTTTCTCGAGGTTCGCACCACCGTTGC 
GCAAGAATAC-3') and YP_Mdn_Acc65I (5'-TTTTGGTACCATCATT 
CATAACGTCATCATTCATAAC-3 ' ) . The resulting DNA fragment was 
digested with Xhol and Acc65I and inserted between the Xhol and 
Acc65I sites of the low-copy-number plasmid pLG338E, which was de- 
rived from pLG338 (41) by eliminating the EcoRI site. The open reading 
frame ofgalM was restored by 8 different base pair insertions, using PCR 
mutagenesis with four designed primers and the previously used 
YP_Mdn_Acc65I primer. The designed primers are Yps_D (5'-TTTTGA 
ATTCACCAAATTGCAGAATAAAAGCGGTATGACCGTCACCTTT 
ATGGATTGGGGGGCAACCTGGTTATCGGCCATAT-3 ' ) , Yps_EKQ 
( 5 ' -TTTTGAATTCACCAAATTGCAGAATAAAAGCGGTATGACCGT 
CACCTTTATGVAATGGGGGGCAACCTGGTTATCGGCCATAT-3'), 
Yps_N ( 5 ' -TTTTGAATTCACCAAATTGCAGAATAAAAGCGGTATG 
ACCGTCACCTTTATGAACTGGGGGGCAACCTGGTTATCGGCCA 
TAT-3'), and YpsJTR ( 5 ' -TTTTGAATTCACCAAATTGCAGAAT 
AAAAGCGGTATGACCGTCACCTTTATGABATGGGGGGC 
AACCTGGTTATCGGCCATATTACCGCTGAAAAAT-3'). The ampli- 
fied fragments were used to replace the EcoRI-Acc65I fragment in the 
pLG338 plasmid containing the galM pseudogene. The DNA sequence of 
the inserted fragment was verified in all the constructs created. 

Protein purification. Expression and purification of the hexa- 
histidine-tagged Y. pestis GalR followed the protocol described before for 
E. coli GalR purification (40). HU protein was purified according to the 
method described by Aid et al. (42). CRP was purified as described by Ryu 
et al. (43). 

In vitro transcription. Transcription reactions were performed as 
described previously (16). The reaction mixture (50 /oil) contained 20 mM 
Tris acetate at pH 7.8, 10 mM magnesium acetate, 200 mM potassium 
glutamate, 100 /aM cAMP, and 2 nM supercoiled plasmid DNA template. 
GalR and CRP concentrations are as indicated in Fig. 5 and 6. HU was 
used at an 80 nM concentration when present. Twenty nanomolar RNA 
polymerase (USB) was added before incubation of the reactions at 37°C 
for 5 minutes. Transcription was started by the addition of 1 .0 mM ATP, 
0.1 mM GTP, 0.1 mM CTP, 0.01 mM UTP, and 5 uCi of [a- 32 P]UTP 
(3,000 Ci/mmol). Reactions were terminated after 10 minutes by addition 
of an equal volume of transcription loading buffer (0.025% bromophenol 
blue, 0.025% xylene cyanol, 0.01 M EDTA, and 90% deionized forma- 
mide). After heating at 90°C for 3 minutes, the samples were loaded onto 
7% polyacrylamide-urea DNA sequencing gels. RNA bands were quanti- 
fied using the ImageQuant Phosphorlmager (Molecular Dynamics, CA) . 
Band intensities were corrected by the background and normalized to the 
RNA1 band intensities of the corresponding lanes. 
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